Separating Features from Noise with Persistence and Statistics
نویسنده
چکیده
In this thesis, we explore techniques in statistics and persistent homology, which detect features among data sets such as graphs, triangulations and point cloud. We accompany our theorems with algorithms and experiments, to demonstrate their effectiveness in practice. We start with the derivation of graph scan statistics, a measure useful to assess the statistical significance of a subgraph in terms of edge density. We cluster graphs into densely-connected subgraphs based on this measure. We give algorithms for finding such clusterings and experiment on real-world data. We next study statistics on persistence, for piecewise-linear functions defined on the triangulations of topological spaces. We derive persistence pairing probabilities among vertices in the triangulation. We also provide upper bounds for total persistence in expectation. We continue by examining the elevation function defined on the triangulation of a surface. Its local maxima obtained by persistence pairing are useful in describing features of the triangulations of protein surfaces. We describe an algorithm to compute these local maxima, with a run-time ten-thousand times faster in practice than previous method. We connect such improvement with the total Gaussian curvature of the surfaces. Finally, we study a stratification learning problem: given a point cloud sampled from a stratified space, which points belong to the same strata, at a given scale level? We assess the local structure of a point in relation to its neighbors using kernel and cokernel persistent homology. We prove the effectiveness of such assessment through several inference theorems, under the assumption of dense sample. The topological inference theorem relates the sample density with the homological feature size. The probabilistic inference theorem provides sample estimates to assess the local structure with confidence. We describe an algorithm that computes the kernel and cokernel persistence diagrams and prove its correctness. We further experiment on simple synthetic data.
منابع مشابه
Separating Topological Noise from Features Using Persistent Entropy
Persistent homology appears as a fundamental tool in Topological Data Analysis. It studies the evolution of k−dimensional holes along a sequence of simplicial complexes (i.e. a filtration). The set of intervals representing birth and death times of k−dimensional holes along such sequence is called the persistence barcode. k−dimensional holes with short lifetimes are informally considered to be ...
متن کاملOn the Bootstrap for Persistence Diagrams and Landscapes
Persistent homology probes topological properties from point clouds and functions. By looking at multiple scales simultaneously, one can record the births and deaths of topological features as the scale varies. In this paper we use a statistical technique, the empirical bootstrap, to separate topological signal from topological noise. In particular, we derive confidence sets for persistence dia...
متن کاملThe effects of traffic noise on memory and auditory-verbal learning in Persian language children
Background: Acoustic noise is one of the universal pollutants of modern society. Although the high level of noise adverse effects on human hearing has been known for many years, non-auditory effects of noise such as effects on cognition, learning, memory and reading, especially on children, have been less considered. Factors which have negative impact on these features can also have a negat...
متن کاملمفهوم ماندگاری در معماری اسلامی و مقایسهی آن با مفهوم پایداری در معماری معاصر
The present paper is extracted from a purposeful research aiming at pondering on the recondite meaning of persistence in Iranian architecture during Islamic era and comparing it to the concept of sustainability and contemporary sustainable architecture. It seems that the literature is poor in terms of causes and factors of persistence in Iranian architecture. The limited literature in this area...
متن کاملStudy of Noise Loudness and Sharpness Effects on Cognitive Performance on Male Students of Tehran University of Medical Sciences
Introduction: The noise could affect some aspects of human health, including the cognitive performance. In addition to sound pressure level and exposure time, the psychoacoustic features of noise may cause destructive effects on humans. A few recent studies have been conducted on effect of sound quality on cognitive performance. This study aims to find the noise loudness and sharpness levels as...
متن کامل